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Matches detected during the first 20 forward iterations O 
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Results for gi|1788084|^>|AAa4854.1| 

There were no matches to Pf am-A (including borderiine matches) for giI1788084lgblAAC748S4.1 1 

Matches to Pfam-B 



Domain Start |End | Evahie Alignment! 
[Pfam-B 39416 p33~ S^HlDgT iATrj^ j 



j "j [477 xesi&us] 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 

Format for fetching alignments to Pfam-B families: [Hypertext linked to SWISSpfam □ ( 
Query gill 788084 Igb I AAC74854.1 1/233-423 matching Pfam-B 39416 

YEAHJECOLI 233 DLRYKNYEKRPDP S S Q AVHF CLMD V S G SMB Q STXDMRKRFYILLYLFI. 5R 282 
. M( , ftftft6j , 1 „, M . DLRYKNVEKRPDP S S QAVtIF CLMD V S G SHD Q STl^Mi^pyiLLYLFL SR 

gil 1788084 |gb|AAC 74854. 1| 233 DLRYKN YEKRPDP S S Q AVMF CLMD9 s g smd q stkdm akkp vxll vlfl SR 282 

YEAH_EC0LI 283 TYKJWE W YIRHHT QAKEVDEHEFFY S QBT G GTIV S S AOXMDE WKERY 332 
TYKNVEV VYIRHKT QAKEVDEHEFF YS QET G GTIV S S ALXLWDEV VKERY 
gi| 1788084) gb | AAC 74854. 1 1 283 TYKJWEVVYIPJWT QAKEVDEHEFFY S QET G GTIV S S ALXLMDE W KERY 332 

YEAK_EC0LI 333 NPAQWHIYAAQ ASD 6DHWADD SPX CHEILAXXLLP V VRYY 5 YIE JTKRAH 382 
.^<v ftAO >. _ . . _ - . _ HP AQWHIY AAQASD GDNWADD SPL CHEIL AKKLLP VVRYY 5 YIEITRRAK 

gx|1788084|gb|AAC74854.1| 333 HP A QWHIY AAQASD GDHWADD SPJL CHEILAKKLLP V VRYY S YIZITRRAH 382 

YEAH_ECOLI 383 QTLWREYEHL QSTFDHF AM QHTRD QM)I YRVFREXFHX.QMA 423 
QTLWREYEKLQ STFI)HFAMQHrRDQDDIYPVFRELFKK.QHA 
gi|1788084fgb|AAC74854.1| 383 QTLWREYEKLQSTFDHFAMQKIRDQDDIYPVFRELFHKQHA 423 



Align to family 



If you think there is anything wrong with this script, please, contact Pfam 
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LOCUS 
DEPINITIOH 
ACCESSION 
PIB 

VERSION 
BB SOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOVRHAL 
MEDLINE 
PVBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AAC74854 427 aa BCT 01-BEC-2UUU 

orf, hypothetical protean [Escherichia coli M21. 
AAC74854 
gl 788084 

AAC74854. 1 01: 1788084 
locus AE000273 accession AE000273. 1 

Escherichia coli K.12. 
Escherichia coli K12 

Bacteria; Proteohacteria; gamma subdivision; Enterobac teriaceae; 
Escherichia. 

1 (residues 1 to 427> 

Blattner,F.R. , Plunkett, g. in, Bloch, C.A., Pema,N.T., Burland,v., 
Riley, M., Collado-vides, j. Glasner, J. B*. , R©de,C.K., Mayhew, G.F. . 
Gregory., Davis, N.W., Kirkpatrick, H. A. , Goeden,M.A. , Roseau. J., 
w au / B . and Shao^. 

The complete genome sequence of Escherichia coli K.-12 
Science 277 (5331), 1453-3474 (1997) 
97426617 
9278503 

2 (residues 1 to 427> 
Blattner, F . R- 
Birect submission 

Submitted (16-JAN-1997) Gup Plunkett in. Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, VI 53706, USA. 
Email: ecoii0sfenetics.visc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

3 (residues 1 to 427) 
Blattner,F.R. 
Birect Submission 

Submitted (02-SEP-1997) cup Plunkett III, Laboratory o£ Oenetics, 
university of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecolidgenetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

4 (residues 1 to 427) 
Plunkett, c. HI. 
Birect submission 

Submitted (13-0CT-1998) Laboratory of Genetics, University of 
Wisconsin, 445 Henry Mall, Madison, WI 53706, USA 
comment This sequence was determined by the E. coli Genome Project at the 

university of Wisconsin-Madison (Frederick R. Blattner, director). 
Supported by HIM grants HG00301 and HG03428 (from the Human Genome 
Project and NCHGR> . The entire sequence was independently 
determined from E. coli K12 strain MG1655. Predicted open reading 
frames were determined using GeneMark software, kindly supplied hy 
Mark Borodovsky, Georgia Institute of Technology, Atlanta, ga, 
30332 [e-mail: markSamber.gatech.edu). Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
Site Nos., unique IB nos. for the genes in the E. coli Genetic 
Stock center (cgsc) database at vaie university, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
(http://cgsc.biology.yale.edu>. Annotation of the genome is an 
ongoing task whose goal is to make the genome sequence more useful 
by correlating xt with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coii 
Genome Project's World wide Web site 

(http://www.genetics.wisc.edu). The E. coii Kl2 sequence and 

its annotations are periodically updated; this is version M54. no 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase 'b* assigned to each gene (protein- ©r 
RHA- encoding) are now designated as gene synonyms instead of 
labels. This should allow them to he searched for in Entrez as gene 
name s . 

Method: conceptual translation. 
features Location/Qualif iers 

source 1..427 

/organism= M Escherichia coli K.32" 

/strain=* M JU2" 

/ sub_s tr ain= -« G 1 6 55 " 

/db_xref « M tax on: 83333" 
Protein 1. .427 

/ func tion= "orf; Unknown" 

/products- orf hypothetical protein- 
CBS 1..427 

/genes-yeaH- 

/coded_by=-1788078:6385. .7668* 
/ trans l__tab le = 11 

/note=-o427; This 427 aa ORF is 28 pet identical (43 gaps) 
to 327 residues of an approx. 332 aa protein VZBC bacsu 
SW: P45742- ~ 
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22 kdeyldiife dlalpnlkqn qqrqlteykt hragytangv paniswrsl qiislarrtam 

* t ^q KCTfc ' Lfta leenlaiisn sepaqlleee rlrkeiaelr akiervpfid tfdlryknye 
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Select Database 



Release: 
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Apply Filters 



• Iteration Filler. PSI- BLAST matches to be excluded: 



None 



If you select e.g. "Matches detected during the first 3 iterations" these matches will be excluded from the jj 
report (using the first_PBjter annotation). This allows you to focus on more remote homologous which have W. 
been detected alter 4 or more PSI-BLAST iterations. Matches detected using PSI- BLAST with negative m 
iterations or using Genome-Threader are not effected by this option. However, if one match is found during p| 
the first e.g. 3 PSI- BLAST iterations and by Genome-Threader it will be excluded. |r- 
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l . '^^^ Bookmarks " Jfc Location: j http: //www, sanger. ac. uk/cgi-bin/Pfam/nph-search.cgi 



;3> : Help 



MP 



, c The Pfam 

I O anger Protein families database of alignments and HMMs 

V^eDire Home I Keyword search! Protein searci} | DATA search | Browse Pfam | Taxonon^ searcbf Help 




Results for 0 |2367274|gfr|AAC76768,l| 

There were bo matches to Pfam- A (including b oiderhne matches) for gil2367274 1 ^bl AAC76768 .1 1 

Matches to Pf«m-B 



i Domain jStart 


EndjEvatae jAEpmcnt] 


!P(am-B 15204 204 


408 j2.4e-108|Mga | 



t 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 
Fonnatforfetctagalig^^ | Hypertext linked to SWSSpf am □ \ 



Query gil2367274IgblAAC76768.1li204-408 matching Pfam-B 15204 

YIZMJECOLI 
gil23S?274|gb|WlC7S7S8.1| 
TIMJ&COLI 

fXEHJECQLI 
gi| 23$?274| gbl.aAC757S8.il 

raM_rcoii 
yi| 23672741 gb|AftC75768.3| 

YIMJECOtI 
gi|23S72?4|gb|A*c7S?68.1| 



204 DIlJa.U>PmTL6imEm?mVEKQLLT?W^6ESWFEKraWV 253 

DILP1LPPE1ATL CIT2LE7r7YRRLVZK.QLLTYRLH6E SVFXmrPF V 
204 DILPJJJ»PZLML6ITELmrYmV»QLLTYPXHCESWPXK?IEW>V 253 

2S4 VHKDVDE QPRCPFIV CVDT 5 C 5WGGPHI QCWA?CLM.MRIW.ArHRP.C7 303 

VKKDYDEQPR6PFIV CVDT 5 CSBG CFHEQCJ^CUUJJBIJ&WmWC? 
254 CPR5PFI VCVDT 5 5 SMG6?f}EQCJOWCIiUifKXillAER3tRC? 303 

304 IHLF STEIVRm 5 GP Q CIZ Q AIPOTt SQQFRC &TDLA5CFPAIHERLtl 5* 353 

IW1^5invn?IX5CPQClEQAim50Q7^ecrDtA5CFPJaMEPX05n 
3M IHLPSTEIVP^rLSCPqCrEQAIPJLSQQFRGGTDUlSCPWItirPXQS^ 353 

354 EOTB ADAVVI SDFXftQRLPDDVT SOT KEL QRVHQHRFHAV JIM S AK6KP 61 403 

rVTDmVVISDFIAQPXPDDVTSKVmQBVMQHPJHRVAaSAKCKPGI 
354 EOTDADAraSDmQiaPnD?TSK?mgKVKQHRFKftViltf5flKGTPGr 403 

404 HRIPD 408 

mm 

404 BKtTB 408 



Align to family j 



If you think theieis anything vrong with this script please contact Pfam 
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Bookmarks S Lde^ti^n: 



LOCVS 

pz rmm w 

ACCESSION 
FID 

VERSION 
RESOURCE 
KEYWORDS 
SOURCE 

ORGANISM 



01-DEC-2000 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUB MED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Protein 



CDS 



ORIGIN 



AAC76768 427 a« BCT 

orf * ^VP 0 ***** 0 ** * rotei * Escherichia coli K12J. 
AAC76768 
g2367274 

AAC76768. 1 CI: 2387274 

Locus AE 0004 51 accession AE 00 04 5 3 . 1 

Escherichia coli XI 2. 
Escherichia coli 

Esch"ichiI r0tCOb * Ct€ri * ; ***** s^*^* 51 ™; Knterobacteriaceae; 

1 (residues 1 to 427) 

Blattner„F,R. , Plunkett, G. TTT Bloch C A x>^m^ u <r n».i-i^,f w 

^i^ r -^d 5E2V- ,r - * . ™Z» *:, ' 

3278SP~ 

2 (residues 1 to 427> 
Blattner,F.R. 
Direct submission 

submitted (26-JAN-1S37) cup Plunk ett Hi, Laboratory of Genetics 
university of Visconsin. 445 Henry Wall/Madison, VX 53706 USA 
S^W-vJm***** tiCS Visc * du 608-262-2534 Fax:' 

3 (residues 1 to 427) 
Blattner,F.R. 
Direct submission 

submitted (02-SEP-193?) cup Plunkett in. Laboratory of Genetics 
University of Wisconsin, 445 Henry Mall/Madison, VI 53706/ USA 
608^263-7459 to * 11 * txcs.wise.edu Phone: 608-262 -2534 Fax: ' 

4 (residues 1 to 427) 

PlUnkctt # 6. TTT 

Direct Submission 

submitted (33-OCT-1998) Laboratory of Cenetios University *f 
Wisconsin, 445 Nenry Ball, Madison, VI 53706* USA ^ 
This sequence was determined by the E. ooli Cenome Project at the 
Universito of Visconsin-Madison (Frederick R. Blattne? director? 
Supported by NTH pants HG00301 and HC03428 (f rem the Huma£ Senome 

5nZ»teS*«7S7 m J3t* Si^t "~ independent!? 

determined from E. colx K12 strain HC1655. Predicted open reading 
frames were determined using ceneMarx software, kindly supplied bu 
™£* B ? rod °^ kp ' e J2 rfl £* »*tAt«t€ of Technology, Atlan^ CA P 
30332 [e-mail: markftamber . gatech. eduj . Open reading frames that 
have been correlated with genetic loci a« being iSotlted with CG 
Site Nos. , unique id nos. for the genes in the £ ^oii e!*»Ii, 
5 toe* center (Ssc> database a£* J JruniC?rsityf kindly s^piied by 
Mary Berlyn. A public version of the database is accessible* P 
(http://c gsc.biology.yale.edu). Annotation of the genome is an 
on ^°^ **** «*Jf * if make the genome secuencTmort Seful 

by correlating it with other data. Comments to the authors I™ 
appreciated, updated information will be available at u£ I coli 
cenome Project's Vorld Vide Veb site wie r. coli 

^ttp: //www. genetics.wisc.edu). *** The E. coli JU2 sequence and 
its annotations are periodically updated; this^iT vers ion M§4 No 
an3To?o^tr g !!i SSi 0 ^^ «P**tes: updated gene iolntif Icilions 
and Products; all new functional assignments courtesy of Bonica 
Riley; added promoters, protein binding sites, and repeated 
sequences described an reference a. The unique numeric identifiers 

^S^ln^r*^*??? *K *T?*ch gene (pro of 

RRA- encoding) are now designated as gene synonyms instead of 
labels. This should allow them to be Searched for in EntreS as gene 

oonoeptual translation. 
Location/ qualifiers 
3.. 427 

/organism- "Escherichia coli R12" 
/strain=*JU2" 
/sub_s train* "MCI 655" 
/db_xrefa- tax on: 83333" 
1..427 

/function^" orf ; Unknown* 
/PEodnot»"orf, hypothetioal protein" 

/gene**yieM" 

/coded _bp=* complement (2367272: 5249. . 6532)" 
/translatable =11. 

witer^rsion^***' CW9€ joinS °** S pUj> <rom 



Method: 



»fS*^??f? ??J^w5 Pqqsgllstp qfivqlpqil dllhrinspw aegarqlvda 
if? «i2f?fi5f Avqattinqq Ueeeregll sevqe rmtls gqleplladn 

J^?^^ ^i^?^? 1 **<n>*l*rla eqlgrsreak iiprndlgme 
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Database 



IE 



^protein. JtfV. 1 , t \ ^ " , jm 
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Re -Edk View 


:^Gqr -Communicator ^ J* 


r ■• % • j+ ■■■■■■■ • JV.v vi ... • 


rite. 






wte ^ Location:) http: //vi 


ctoria. inpharmatica.co.uk/-~volker/BPD3target.html 








Target Mining 
Interface 



iri»nhatica 



Select Your Query Sequence 



■ Enter PDB accession number (e.g. 1 QMA): 



OR 



5 JLM md chain (e.g. B): 



• Enter one Swiss- Prot accession (e.g. P27504) or Gen Bank proteinID (e.g. CAB08761 .1 ): ] 



Select Database 



Release: 



DEVF9 = BPD3 D 



Apply Filters 



• Iteration Filter. PSI- BLAST matches to be excluded: 



None 



If you select e.g. "Matches detected during the first 3 iterations" these matches will be excluded from the report 
(using the first_PBjter annotation). This allows you to focus on more remote homologous which have been 
detected after 4 or more PSI-BLAST iterations. Matches detected using PSI-BLAST with negative iterations or 
using Genome-Threader are not effected by this option. However, if one match is found during the first e.g 3 
PSI-BLAST iterations and by Genome-Threader it will be excluded. 

• Rlter for the following SPECIES: 

| DHomo sapiens 
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-Tile .^E^^^^fjZ^Sll^^^^^^^L^ 'a'' * " * i- * ' j** * ' ' ' ^ " * * ^ < \ « 'Help 

' B^c^ > ' Forward :,, RfeloadyyJ^Hbme^'fi ^ea^^Net^cajfe'-^ KRririifc;": : | ; Se^n^;^;|tte>fe^ ' H v.K' i"^^. 



I . ^ ookmar ^ Lopatidn:/]lhttp: //London-bridge. tnpharmat?c^.co.uk/cQi43irWolker/getf argetBPD3.pt 



Br 



2) 81 additional hits identified by both, Genome Threader and PS1-BLAST: 

Combined Ginonu Thread* r&ndPSI -Blart output: PSI- BLAST values m* ihown in maroon! 



Add2iist BPD links WWW link 




Organism 



%1D 
(GT,PSi) 



Query 
rcjn. 
(GT,PSI) 



Target rgn. 
(GT,P8I) 





1st 


Conf.(GT) 


Iter. 




(PSI 



: I - 



\ AAA59544.1 | 
I drill through I 
|Top50BlMlrik!MA5W41L 



j fted.Seq.Vfcy 



Not given 



Homo 
sapiens 



PR! f 



H00%. 100% 



junmaskedSW 



1-187, 
1-187 



148-334, 
148-334 



488 



100% 
umnaskedGT 



1 AAB24B2K1. | 
i drill through I 
lTop50BtotHiblAAg2^iLi 



leukocyte integrin alpha chain 



| Red.Seq.Vlew \ 



Homo 
sapiens 



[100%, 100% 



PRI 



iunroaskedSW 



1-187, 
1-187 



148-334, 
148-334 f 



100% 
Unmasked GT 



Q99715 | 
drjl through I 
jTopSOBIastHts i QW?iS 



COLLAGEN ALPHA 1 (XII) 
CHAIN PRECURSOR. 



Red.SegMw ! 



Homo 
sapiens 
(Human). 



PRI 128.9%. 28% 
m bmaskedSW 



2-186, 
2-179 



439-617, 
2322-2494 



456 



100% 
unmaskedGT 



l AAB38702J | 
; drm through I 
jTopSOBtotHtts jMBMgU 

I Red.Seq.Vlgy I 



car Sage matrix protein 



Homo 
sapiens 



pm 1 26.9%, 25% 
rm kinmaskedSW 



2-186, 
2-186 



274-452, 
40-221 



446 



100% 
•unmaskedGT 



drill through 
fopSOBtastffits AAC0I506I 



AAC01506J 



Red.SeciMew 



CM72402J 



drfll through 
; Top50BlastHrh 



Red,Seq.\*e\y 



type XII coBagen 



Homo 
sapiens 



CAA72402.1 



collagen type XIV 



Homo 
sapiens 



dpi j28.4%. 28% 
rw lunmaskedSW 



2-186, 
2-186 



137-318, j 
137-318 i TO 



103% 
UnmaskedGT 



PRI 



|287%, 30% 
l unmaskedSW 



2-186, 
2-188 



6-185, 
6-185 



442 



100% 

flinmaskedCT 



j AAB3B547.1 
I drill through 
i TopSOBlastHits 



AAB38347.1 



Red.SeqAfev 



ieukolntegrin alpha d chain 



Homo 

sapiens 



PRI 



}61%,60% 
hinmaskedSW 



1-187, j 148-334, 
1-187 } 148-334 



439 



100% 
junroaskedGT 



CAB71222.I 
drffl through 
TooSOBlaslNls 



CAB7 1222.1 



dJ238015.l (collagen, type Xtl, 
alpha t) 



Homo 
sapiens 



ppi 127.1%. 22% 
rni lunmaskedSW 



| 

M86, 1 293-472, 
2-186 11431-1620 



439 



Red.Seq.Vtev 



100% 
unmaskedGT 



CAA07569.1 
drfl through 
TopSOBIastHts 



CAA07569.1 



matron -4 



Homo 
sapiens 



PRI 



27.9%. 24% 
unmaskedSW 



2- 186, i 344-525, 

3- 187 ! 34-215 



418 



100% 
unmaskedGT 



Red.SeqAfev 



CAB46380.1 



j drB through 
jTopSOBIastHrt) 



CAB4638D.I 



dJ453C12.3(matrfn-4) 



Homo 
sapiens 



; PRI 



27.9%. 24% 
unmaskedSW 



2- 186, 

3- 186 



lRed.Seq.Viw 



385-566, 
34-214 



418 



100% 
lunmaskedGT 
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File; f Edit View ^ Go Communicator % - „ \ * . \ " ^ f ^ - 

T- -W7?.\'£r~ V;-;;'-''^vr : ' ^ Aval ' ~ ~ - . % ~ a .^L ~ V ~r\*v" "-'v , " 



;Bacfr ^Forward Reload * Home > v Search Netscape!,; ."Print •■ -ft* gecuri^V;>r ^to^" ^ / ^ H.U V^'-r* 



^'^^-S-Pop^Q^"' vii^^Locatioji: |?Kttp: //www. Sanger, ac. uk/cgHbin/Pfam/nph-search.cgi 



, c The Pfam 

i kflHger Protein families database of alignments and HMMs 

^Cnire Home | Keyword searchl Protein seardi | DNA search I Browse Pfam | Taxonomy search | Help 




Results for giJ133251|sp)P10155|RO60_HUMAN 

There were no matches to Pfam- A (mcradmg borderline matches) for gill 33251 1 sp IP1 01 55 DRO60 JH UMAN 

Matches to Pfam-B 

( Domain [Start fEad [Evatue =AKgranenl | 



jPfam-B^344 jt 1194 |2.3e-103 jAhgn 
i Pfam-B 1 oT^ ll5s 15^ jl.8e-165 jAlig n 



I 



] f538 residues) 



Alignments of Pfam-B domains to best-matching Pfam-B sequence 

Format for fetching alignments to Pfam-B families: | Hypertext linked to SWiSSpf am □ ] 



Query gill 33251 1 sp!H 01 55 IRO60 JIUMANrt - 

Q32787 1 

gi|I332S3|sp|PlO355|RO£0_HVMAH 3 
Q9278? 51 

gx\ 133251 1 sp t PI 0155 IROGO^HUMAH 51 
092787 101 

gx\ 1332511 sp|Pl0155|R060_HUMAH 103 
092787 153 

gi|133251|spjPl0155tRO60_HUMW 151 



194 matching Pfam-B 8344 



MEESVHQWQPLHrKQI«NSQDGWWQVTDMHRLMRPLCFCSECGTYYIKB 50 
MEE S VH QM QPLBEKjQI AH 5 QD GVVWQVTDMHRLHRFL CFGSEG CTYVIKE 
MEE SVn QM QPEKEKQI AN S QDGY WQVTDMHIU.HRFL CPGSEG CTYVIKE 50 

QW.Gi&WEjaxsatti>QKeczvxq[^^ 100 

QKE CEERftE APPLIED GAG CEVXQEXK5F S QE GPTTXQEPMLF AL AI C 5 
QKX CLEHAEALIPXIED CRD CEVI QEIXSF S QE GRTTKQEPMEF AL AI C S 100 

Q C SI>I STF.Q AAFKAV SEV CPOCPTMLFTFI QPXKDLFE SHKC GMWGRAERK 150 

Q C SDI STKQ AAFKRV SEV CRIPTHLFTPI QFKKDLKE SMKC GMVGRALRK 

Q C SDI STX.Q AAF KAV SEV CRIPTJflFTFI QP KKDjLKE SMKC GMVGRAERK. 150 

AI ABWYHEKG GMALALiiVT3»^QK» GW5mO)ELRL SHLXP S5EG 194 
AXAIXWYREF.G G» AIJU. AVTXYFvQRH GV SMKDLLPX SMLXP S SE C 
AIJlWYHEKGCM^AEAVT^YKQPOIW^ 194 



| Align to family 1 1 



Query gill33251lsplP10155IRO60_HUMAhW95-538 matching Pfam-B 10162 



008848 395 LAIVrayiTXGVXEVMEEVXEXAl. SVEAEKLLKYLEAVEKUKPJKDDI^V 244 
LAXUTKYITKGVKEVKE VKEKALSVE EKIXKVEEAVEKVKRTKIH-LE7 
gi| 133251| sp | Pl0155lRO€0_HUaAM 195 LAI VT1OTTKGW1SE VKELrKEKAl. SVETEKLEKtfI»EAVEFvVXRTKDEI,EV 244 

008848 245 I KLX E EKQ L VRE KLi THKLK.SKE VWKALL QEMPLTALLKRL GKMTAH SUX 294 
TW.THF.M+I.WT^Hia-TnMLXSKE\7WXJa.L OEWPLT ALLAHL GKMTAA SVL. 



008848 295 EP GH SEV SEI CEKE 5HEKIXlQ<APa:KPFltVI^ ALF^PsAGHGLRGKI^l^ 344 
EP6MSEWSL+CEKL KEFJJJOtfUU3a>FH+I~IAI.ET?+ GHGLRGKLFCT 
giJ133251|sp|Pl0155|AOS0_HUllAH 295 EP CM SEV SEV CBKLCHEKLLKKHRXMPFHOJ ftUTYKT 6KGLRGKL3BWR 344 

_ ____ 008848 345 PDKDXL QAEDAAFyTTFKXVEPT GKRPEEAVD V SASMH QKAE G S VEHAST 394 

.- •> ,y£ ^e^-Tr^^y " ? -.Tyj^^ r ^g^ZLggg^v^^ 
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to CVS 
DEFIMITIOM 

ACCESSION 
PIP 

VERSION 
DBSOURCE 



KEYWORDS 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
WEDLIMK 
REMARK 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLIHE 
REMARK 
COMMENT 



FEATURES 

souroe 



Pro tein 
Region 



Region 



Region 



Re gion 



ORIGIN 



ROG0_HUMAN 538 aa PRI 01-FEB^1996 

60 KD RO PROTEIN (60 KD RIBOHUCLEOPROTEIN RO) (RORNP) < SJOGREN 

SYNDROME TYPE A ANTIGEN (S5-A)). 

P1015S 

^133251 

P101S5 01:133251 

swissprot: locus RO60_HUMAN, Accession P1015S; 
class; standard, 
created: Mar 1, 1389. 
sequence updated: Mar 1, 1989. 
annotation updated: Feb 1, 3 996. 

xrefs : gi; gi: 177732. gi: gi : 377783. gi: gi: 387656. gi; gi: 
3876S7. gi: gi: 86722. gi: gi: 307626 

xrefs (non-sequence databases) : WIM 600063. MIM 234700. pROSITE 
PS00030 

Ribonucleoprotein; RNA-binding; systemic lupus erythematosus; 

Antigen. 

bum an. 

Homo s at> i ens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Kominidae; Komo. 

1 (residues 1 to 538) 

Deutscher, S.L. , Harley,J.B. and Keene,J.D. 

Molecular analysis of the 60-kDa human Ro riionucleoprotein 
Proc. Natl. Acad. Sci. U.S.A. 85 (24), 9479-9483 (1988) 
89071722 

SEQUENCE FROM N.A 

2 (residues 1 to 538) 

Ben-Chetrit,E. , Gandy,B. J. , Tan, E . M . and Sullivan, K. F . 

isolation and characterization of a cDha clone encoding the 60-kD 

component of the human S S -fl/Ro ribonucleoprotein auto antigen 

J. Clin, invest. 83 (4), 1284-1292 (1989) 

89398084 

SEQUENCE FROM N.A. 

This swiss— prot entry is copyright. It is produced through a 
collaboration between the svr±ss Institute of Bio informatics and 
the embe outstation - the European Bioinf o rata tics Institute. 
The original entry is available f rom http://www. expasy . ch/sprot 
and http://www.ebi.ac.u3c/sprot 

(FUNCTION] UNKNOWN. 

I SUBUNIT ] RO SMALL RIBONUCLE OPR0TEINS CONSIST OF FOUR SMALL. RNA 
MOLECULES OF 85-112 NT, EACH OF WHICH IS COMPLEXES WITK A 60 KD 
PROTEIN. RO RNPS WAV ALSO CONTAIN AN ADDITIONAL 52 KB PROTEIN. 

[SUBCELLULAR LOCATION) CYTOPLASMIC. 

[DISEASE ] SERA FROM PATIENTS WITK SYSTEMIC LUPUS ERYTHEMATOSUS 
OFTEN CONTAIN ANTIBODIES THAT REACT WITH THE NORMAL CELLULAR RO 
PROTEIN AS IF THESE ANTIGEN WAS FOREIGN. 

I SIMILARITY ) CONTAINS 1 RNA RECOGNITION MOTIF (RNP) . 
£ SIMILARITY J STRONG, TO XSHO^VS 60 KD RO PROTEIN. 
L o c at ion/ Qualif ie r s 
1. .538 

/organism="Homo sapiens** 
/db_xref=-taxon i 9606- 
1. . 538 
1 . . 538 

/products" 60 KD RO PROTEIN* 
93. .98 

/ re gion^name = "Domain" 

/no te = " RNA-BIHDIN 6 (RNP 2) (BY SIMILARITY) . »' 
124.. 131 

/ r e gion_name = "d omain" 

/no te = w RNA-BINDING (RNPl) (BY SIMILARITY) . - 

/ re gion_name =" c onf lie t" 
/note= w K -> R (IN REF. 2) . - 
515.. 538 

/ re gion_name= w c onf lie t M 

/ no te = M GMLD M C GFDT G ALDVIRNFTLDMI - > AL QNTLLNKSF (IN REF. 
2)." 



1 meesvnqmqp Inek qiansq dgpvwqvtdm nrlhrf lef g seggtyyike qklglenaea 

61 lirliedgrg ceviqeiksf sqegrttkqe pmlfalaics qcs distkqa afltavsevcr 

121 ipthlftfiq fkkdlkesmk cgmwgralrx aiadwyxvekg gmalalavt3c yiqmgwshk 

181 dllrlshlkp sseglaivtk yitkgwkevh elykekalsv etekllkyle avekvkrtkd 

241 elevihliee hrlvrehllt nhUcskewJc allqemplta llrnlgkmta nsvlepgnse 

301 vslvceklcn eJclUtkarih pfhiliaiet yktghglrgk Jkwrpdeeil kaldaarpkt 

361 fktveptgkr fllavdvsas mnqrvlgsil nastvaaamc mwtrtekds ywafsdemv 

421 pcpvttdmtl qqvlmantsqi paggtdcslp miwaqktntp advf ivftdn etf aggvhpa 

481 ialreyrkkm dip ak live g mtsngrftiad pddrgmldme gfd tgaldvi rnf tldml 
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